Microarchitecture Characteristics and Implications of Alignment of Multiple Bioinformatics Sequences
نویسندگان
چکیده
With the growth of bioinformatics and computational biology industry, multiple sequence alignment (MSA) applications have become an important emerging workload. In spite of the large amount of recent attention given to the MSA software design, there has been little quantitative understanding of the performance of such applications on modern microprocessors and systems. In this paper, we analyze performance and characteristics of MSA software from the perspective of processor microarchitecture. We use twelve popular MSA programs employing a wide variety of alignment approaches. The basic workload characteristics and the efficiencies of various microarchitecture features, such as trace cache, out-of-order execution, caching, branch prediction, speculative execution and phase behavior are examined and analyzed on the Pentium 4 microarchitecture. Our major observations of this work are: (1) Instruction footprints of MSA programs are typically small and can fit in the L1 instruction cache. The trace cache shows high percentage of time in deliver mode. (2) Loads and stores account for the 60% of dynamic instructions executed. This indicates that further improving memory bandwidth will be beneficial to the performance of MSA software. (3) Prefetching and large L2 cache can efficiently handle the working sets of a majority of the studied benchmarks. Nevertheless, MSA software using exact alignment method yields poor cache performance. The data TLB behavior largely depends on the alignment methods used. (4) The studied MSA applications show large variation in dynamic branch frequency and mix. The indirect branches, calls, and returns can be predicted with high accuracy. The overall branch misprediction rates exceed 5% on half of the examined benchmarks. (5) The IPC of the studied benchmarks ranges from 0.15 to 0.93. Overall, the processor speculatively executes 27% more instructions than it retires.
منابع مشابه
Molecular and Bioinformatics Analysis of Allelic Diversity in IGFBP2 Gene Promoter in Indigenous Makuee and Lori-Bakhtiari Sheep Breeds
The aim of this study was to perform molecular and bioinformatics analysis of IGFBP2 gene promoter in association with some economic traits in indigenous Makuee (MS) and Lori-Bakhtiari (LB) breeds. DNA was extracted from blood samples of 120 MS and 200 LB and a 297 bp fragment from the upstream sequences of studied gene was amplified and genotyped by single-strand conformational polymo...
متن کاملgpALIGNER: A Fast Algorithm for Global Pairwise Alignment of DNA Sequences
Bioinformatics, through the sequencing of the full genomes for many species, is increasingly relying on efficient global alignment tools exhibiting both high sensitivity and specificity. Many computational algorithms have been applied for solving the sequence alignment problem. Dynamic programming, statistical methods, approximation and heuristic algorithms are the most common methods appli...
متن کاملAn Evolutionary and Phylogenetic Study of the BMP15 Gene
DNA sequence data contains a wealth of biologically useful information. Recent innovations in DNA sequencing technology have greatly increased our capacity to determine massive amounts of nucleotide sequences. These sequences can be used to specify the characteristics of different regions, interpret the evolutionary relationships between categorized groups, likelihood of performing multiple com...
متن کاملAn Application of the ABS LX Algorithm to Multiple Sequence Alignment
We present an application of ABS algorithms for multiple sequence alignment (MSA). The Markov decision process (MDP) based model leads to a linear programming problem (LPP), whose solution is linked to a suggested alignment. The important features of our work include the facility of alignment of multiple sequences simultaneously and no limit for the length of the sequences. Our goal here is to ...
متن کاملBioinformatics Analysis of Upstream Region and Protein Structure of Fungal Phytase Gene
Phytase increases the bioavailability of phytate phosphorus in seed-based animal feeds and reduces the phosphorus pollution of animal waste. Since most animal feeds for pellets are heated up to 65-80 °C, the production of a thermostable structure for phytase can be useful. In this study, we sought to perform bioinformatics analysis of the upstream region and protein structure of fungal phytase ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006